Optimal on-line Bayesian model selection for speaker adaptation

نویسندگان

  • Shaojun Wang
  • Yunxin Zhao
چکیده

In this paper, we show how to accomodate a Bayesian variant of Rissanen’s MDL into on-line Bayesian adaptation to control both model structural complexity and parameterization complexity to best fit an available amount of adaptation data, the goal being minimization of resulting recognition error. An efficient bottom-up dynamic programming based pruning algorithm is developed for selecting models using the MDL principle. Speaker adaptation experiments using a 26-letter English alphabet vocabulary were conducted and the proposed Bayesian variant MDL method is shown to provide an optimal tradeoff between recognition accuracy and complexity of model structure and parameterization over a full range of adaptation data size. It in general is capable of automaticlly selecting a set of model parameters that leads to best recognition performance for a given amount of adaptation data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker model selection using Bayesian information criterion for speaker indexing and speaker adaptation

This paper addresses unsupervised speaker indexing for discussion audio archives. We propose a flexible framework that selects an optimal speaker model (GMM or VQ) based on the Bayesian Information Criterion (BIC) according to input utterances. The framework makes it possible to use a discrete model when the data is sparse, and to seamlessly switch to a continuous model after a large cluster is...

متن کامل

Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation

This paper presents a new recursive Bayesian learning approach for transformation parameter estimation in speaker adaptation. Our goal is to incrementally transform or adapt a set of hidden Markov model (HMM) parameters for a new speaker and gain large performance improvement from a small amount of adaptation data. By constructing a clustering tree of HMM Gaussian mixture components, the linear...

متن کامل

Unsupervised speaker indexing using speaker model selection based on Bayesian information criterion

This paper addresses unsupervised speaker indexing for discussion audio archives. In discussions, the speaker changes frequently, thus the duration of utterances is very short and its variation is large, which causes significant problems in applying conventional methods such as model adaptation and VarianceBIC (Bayesian Information Criterion) methods. We propose a flexible framework that select...

متن کامل

Long term on-line speaker adaptation for large vocabulary dictation

On-line speaker adaptation is desirable for speech recognition dictation applications, because it o ers the possibility to improve the system with the speaker-speci c data obtained from the user. Since the user will work with such a device over a long period, for a dictation system the long term adaptation performance is more important than the adaptation speed. In contrast to speaker-dependent...

متن کامل

On-line hierarchical transformation of hidden Markov models for speaker adaptation

This paper presents a novel framework of on-line hierarchical transformation of hidden Markov models (HMM’s) for speaker adaptation. Our aim is to incrementally transform (or adapt) all the HMM parameters to a new speaker even though part of HMM units are unseen in adaptation data. The transformation paradigm is formulated according to the approximate Bayesian estimate, which the prior statisti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000